Combining Speech Retrieval Results with Generalized Additive Models

نویسندگان

  • J. Scott Olsson
  • Douglas W. Oard
چکیده

Rapid and inexpensive techniques for automatic transcription of speech have the potential to dramatically expand the types of content to which information retrieval techniques can be productively applied, but limitations in accuracy and robustness must be overcome before that promise can be fully realized. Combining retrieval results from systems built on various errorful representations of the same collection offers some potential to address these challenges. This paper explores that potential by applying Generalized Additive Models to optimize the combination of ranked retrieval results obtained using transcripts produced automatically for the same spoken content by substantially different recognition systems. Topic-averaged retrieval effectiveness better than any previously reported for the same collection was obtained, and even larger gains are apparent when using an alternative measure emphasizing results on the most difficult topics.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining Evidence from Unconstrained Spoken Term Frequency Estimation for Improved Speech Retrieval

Title of dissertation: Combining Evidence from Unconstrained Spoken Term Frequency Estimation for Improved Speech Retrieval J. Scott Olsson, Doctor of Philosophy, 2008 Dissertation directed by: Associate Professor Douglas W. Oard College of Information Studies This dissertation considers the problem of information retrieval in speech. Today’s speech retrieval systems generally use a large vocab...

متن کامل

Combining Multiple Models for Speech Information Retrieval

In this article we present a method for combining different information retrieval models in order to increase the retrieval performance in a Speech Information Retrieval task. The formulas for combining the models are tuned on training data. Then the system is evaluated on test data. The task is particularly difficult because the text collection is automatically transcribed spontaneous speech, ...

متن کامل

Optimal Multi-microphone Speech Enhancement in Cars

Hands-free speech telephony and speech recognition in cars suffer from additive noise and reverberation. We propose an iterative blind channel estimation algorithm based on an analysis-by-synthesis loop closed around a multipath Generalized Sidelobe Canceller (GSC). By combining a post-filter with the proposed scheme, optimal speech enhancement in practical situations can be achieved. The algor...

متن کامل

به‏کارگیری مدل جمعی‏تعمیم‏یافته در تعیین نوع ارتباط عوامل خطر رتینوپاتی در بیماران دیابتی شهر تهران

  Background : One of the most important complications of diabetes, is diabetic retinopathy that causes the blindness of 10,000 people every year. Different researches have been done on retinopathy risk factors in diabetic patients. This study was carried out to check the type of relationship between retinopathy risk factors and the condition of temptation it with generalized additive models. T...

متن کامل

Missing-feature reconstruction for band-limited speech recognition in spoken document retrieval

In spoken document retrieval, it is necessary to support a variety of audio corpora from sources that have a range of conditions (e.g., channels, microphones, noise conditions, recording media, etc.). Varying band-limited speech represents one of the most challenging factors for robust speech recognition. The missing-feature reconstruction method shows the effectiveness in recognition of the sp...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008